FETI Coarse Problem Parallelization Strategies and Their Comparison
نویسندگان
چکیده
Most of computations (subdomain problems) appearing in FETI-type methods are purely local and therefore parallelizable without any data transfers. However, if we want to accelerate also dual actions, some communication is needed due to primal-dual transition. Distribution of primal matrices is quite straightforward. Each of cores works with local part associated with its subdomains. A natural effort using the massively parallel computers is to maximize the number of subdomains so that sizes of subdomain stiffness matrices are reduced which accelerates their factorization and subsequent pseudoinverse application, belonging to the most time consuming actions. On the other hand, a negative effect of that is an increase of the null space dimension and the number of Lagrange multipliers on subdomains interfaces, i.e. the dual dimension, so that the bottleneck of the TFETI method becomes the application of the projector onto the natural coarse space, especially its part called coarse problem solution. In this paper, we suggest and test different parallelization strategies of the coarse problem solution regarding to the improvements of the TFETI massively parallel implementation. Simultaneously we discuss some details of our FLLOP (Feti Light Layer on Petsc) implementation and demonstrate its performance on an engineering elastostatic benchmark of the car engine block up to almost 100 million DOFs. The best parallelization strategy based on the MUMPS was implemented into the multi-physical finite element based opensource code ELMER developed by CSC, Finland.
منابع مشابه
Parallel Implementation of Direct Solution Strategies for the Coarse Grid Solvers in 2-level FETI Method
The FETI method is based on introducing Lagrange multipliers along interfaces between subdomain to enforce continuity of local solutions [4]. It has been demonstrated to be numerically scalable in the case of second-order problems, thanks to a built-in “coarse grid” projection [3]. For high-order problems, especially with plate or shell finite element models in structural analysis, a two-level ...
متن کاملDual-Primal FETI Method
The FETI algorithms are numerically scalable iterative domain decomposition methods. These methods are well documented for solving equations arising from the Finite Element discretization of second or fourth order elasticity problems. The one level FETI method equipped with the Dirichlet preconditioner was shown to be numerically scalable for second order elasticity problems while the two level...
متن کاملHybrid MPI/OpenMP parallelization in FETI-DP methods
We present an approach to hybrid MPI/OpenMP parallelization in FETIDP methods using OpenMP with PETSc in the finite element assembly and using the shared memory parallel direct solver Pardiso in the FETI-DP solution phase. Our approach thus uses OpenMP parallelization on subdomains and MPI in between subdomains. We investigate the efficiency of this approach for a benchmark problem from two dim...
متن کاملTowards Extremely Scalable Nonlinear Domain Decomposition Methods for Elliptic Partial Differential Equation
The solution of nonlinear problems, e.g., in material science requires fast and highly scalable parallel solvers. FETI-DP (Finite Element Tearing and Interconnecting) domain decomposition methods are parallel solution methods for implicit problems discretized by finite elements. Recently, nonlinear versions of the well-known FETI-DP methods for linear problems have been introduced. In these met...
متن کاملOn an Adaptive Coarse Space and on Nonlinear Domain Decomposition
We consider two different aspects of FETI-DP domain decomposition methods [8, 23]. In the first part, we describe an adaptive construction of coarse spaces from local eigenvalue problems for the solution of heterogeneous, e.g., multiscale, problems. This strategy of constructing a coarse space is implemented using a deflation approach. In the second part, we introduce new domain decomposition a...
متن کامل